Method and device for realizing chinese character input based on uncertainty information

ABSTRACT

The present invention provides a method and device for realizing Chinese character input based on uncertainty information, wherein the method comprises: receiving input information from a user; extracting at least two types of uncertainty information of Chinese characters to be input, from the input information; and, determining the matched Chinese characters according to the at least two types of uncertainty information and outputting the matched Chinese character(s). The device comprises a receiving module, an extracting module and a matching module. The method and device as provided by the present invention allow a user who has incomplete memory of pronunciation or glyph information of Chinese characters to be input to realize correct input of the Chinese characters by defining a certain range for candidate characters corresponding to the Chinese characters to be input, in combination with at least two types of the extracted uncertainty information of the Chinese characters to be input.

CROSS-REFERENCE TO RELATED PATENT APPLICATION

This application claims the benefit of Chinese Patent Application No.201410041504.8, filed on Jan. 28, 2014, in the Chinese IntellectualProperty Office, the disclosure of which is incorporated herein in itsentirety by reference.

BACKGROUND

1. Field

The present invention refers to technical field of terminal devices, inparticular, to method and device for realizing Chinese character inputbased on uncertainty information.

2. Description of the Related Art

The Chinese character is known as its complex strokes, and the modernpeople are more and more unfamiliar with the glyph due to the widelyused Pinyin input system provided by mobile devices, which frequentlycauses problems of character amnesia. Although the handwriting inputsystem supports functions of listing the expected, correct characters inthe options for users who input incorrect characters with a similarglyph, sometimes it may be difficult to immediately recall even a roughprofile for some common Chinese characters such as “

” and “

”.

According to the existing technology, some supplementary means, such asutilizing auxiliary information or exchanging prior knowledge withsystem, are adopted for inputting Chinese characters, apart fromdirectly writing the correct character.

For those Chinese characters of which the particular glyphs cannot berecalled, people may still remember the corresponding pronunciation andtone, or remember the corresponding type of its structuralconfiguration, or remember a certain radical of those characters only,or remember the pronunciation of a corresponding character only andhence can just input a homophone because the Chinese phonetic alphabet(Pinyin) system used by HK or Taiwan citizens is different from that ofthe mainland or because some old people had not learned about theChinese phonetic alphabet. All the information above is referred to asuncertainty information.

The input method of prior art usually realizes Chinese character inputby means of a single type of information such as the phonetic alphabetor the glyph, a user therefore cannot finish inputting a correspondingChinese character based on incomplete memory of the Chinese character orits phonetic alphabet information.

SUMMARY

The present invention provides a method and device for realizing Chinesecharacter input based on uncertainty information, which can solve theproblem of difficulty in correct input of Chinese characters input basedon incomplete memory of phonetic alphabet or glyph information for theChinese characters to be input, by means of at least two types ofuncertainty information as provided by the user who are inputtingChinese characters.

In order to solve the above problem existed in the prior art, theembodiments of the present invention, on one aspect, provide a methodfor realizing Chinese character input based on uncertainty information,comprising steps of:

receiving input information from a user;

extracting at least two types of uncertainty information of Chinesecharacters to be input, from the input information; and determining thematched Chinese characters according to the at least two types ofuncertainty information and outputting the matched Chinese characters.

The embodiments of the present invention, on another aspect, provide amethod for realizing Chinese character input based on uncertaintyinformation, comprising steps of:

receiving input information from a user;

extracting uncertainty information of Chinese characters to be input,from the input information, wherein the uncertainty information includesany one or more types of the following information:

tone information, structural configuration information, radicalinformation, homophone information, and location information; and,

determining and outputting the Chinese characters to be input based onthe determined Chinese characters and the uncertainty information.

The embodiments of the present invention, on yet another aspect, providea device for realizing Chinese character input based on uncertaintyinformation, comprising a receiving module, an extracting module and amatching module:

the receiving module is configured for receiving input information froma user;

the extracting module is configured for extracting at least two types ofuncertainty information of Chinese characters to be input, from theinput information; and

the matching module is configured for determining the matched Chinesecharacters according to the at least two types of uncertaintyinformation and outputting the matched Chinese characters.

The embodiments of the present invention, on still another aspect,provide a device for realizing Chinese character input based onuncertainty information, comprising a receiving module, an extractingmodule and a matching module:

the receiving module is configured for receiving input information froma user;

the extracting module is configured for extracting uncertaintyinformation of Chinese characters to be input, from the inputinformation, wherein the uncertainty information includes any one ormore types of the following information:

tone information, structural configuration information, radicalinformation, homophone information, and location information; and,

the matching module is configured for determining and outputting theChinese characters to be input based on the determined Chinesecharacters and the uncertainty information.

The method and device as provided by the present invention allow a user,who has incomplete memory of pronunciation or glyph information of theChinese characters to be input, to realize correct input of the Chinesecharacters by defining a certain range for candidate characterscorresponding to the Chinese characters to be input, in combination withat least two types of extracted uncertainty information of the Chinesecharacters to be input. In addition, the above solutions as provided bythe present invention just make minor modification to the existingsystems, and hence will not influence the system compatibility.Moreover, the implementations of these solutions as provided are bothsimple and highly effective.

Further aspects and advantageous of the present inventions will beappreciated and become apparent from the descriptions below, or will bewell learned from the practice of the present invention.

BRIEF DESCRIPTION OF THE DRAWINGS

The foregoing and/or further aspects and advantageous of the presentinventions will become apparent and be well understood from thefollowing descriptions of embodiments, with reference to theaccompanying drawings, in which:

FIG. 1 is a flowchart illustrating a method for realizing Chinesecharacter input based on uncertainty information according to anembodiment of the present invention;

FIG. 2 is a flowchart illustrating a method for realizing Chinesecharacter input based on uncertainty information according to anotherembodiment of the present invention;

FIG. 3 is a schematic diagram illustrating the category of thestructural configuration information in a method for realizing Chinesecharacter input based on uncertainty information according to anotherembodiment of the present invention;

FIG. 4 is a flowchart of solution 1 in the method for realizing Chinesecharacter input based on uncertainty information according to anotherembodiment of the present invention;

FIG. 5 is a flowchart of solution 4 in the method for realizing Chinesecharacter input based on uncertainty information according to anotherembodiment of the present invention;

FIG. 6 is a structural diagram illustrating a device for realizingChinese character input based on uncertainty information according to anembodiment of the present invention;

FIG. 7 is a structural diagram illustrating a device for realizingChinese character input based on uncertainty information according toanother embodiment of the present invention; and

FIG. 8 is a schematic diagram illustrating a process of determiningChinese characters with the highest matching degree based on uncertaintyinformation.

DETAILED DESCRIPTION OF THE INVENTION EXEMPLARY EMBODIMENTS

Hereinafter the embodiments of the present invention will be describedin details, and the examples of these embodiments have been illustratedin the drawings, in which the identical or similar reference numerals,throughout, refer to the identical or similar elements or elementshaving identical or similar functions. These examples described byreference to the drawings are illustrative for the purpose of explainingthe present invention only, which shall not be regarded as constitutingany limitations thereto.

It should be appreciated by the person skilled in the art that, unlessparticularly specified, the “one”, “a (an)”, “the (said)” and “this(that)” used herein in single forms also refer to plural forms. Itshould be further understood that, the wordings “include (comprise)”used in the description refer to the existence of the correspondingfeatures, integers, steps, operations, elements and/or componentswithout excluding the possibility of existing or incorporating one ormore other features, integers, steps, operations, elements, componentsand/or groups thereof. It should be realized that when one element isdefined to be “connected” or “coupled” to another element, it can beconnected or coupled to another element directly or by an intermediateelement. In addition, the “connecting” or “coupling” used herein maycontain wireless connecting or coupling. The wording “and/or” usedherein include any individual of or all the combinations of one or morerelated items listed herein.

It should be appreciated by the person skilled in the art that, all theterms used herein (including technical terms and scientific terms),unless otherwise specified, refer to the general meanings well known forthose skilled in the art to which the present invention pertains. Itshould also be understood that, the terms, such as that defined in thegeneral dictionaries, refer to the meanings consistent with the contextof the prior art, and shall not be interpreted excessively ideally orformally, unless as specified herein.

It should be appreciated by the person skilled in the art that, the “UE”and “terminal” used herein include both the device provided with onlyradio signal transceiver incapable of transmitting and the deviceprovided with hardware capable of receiving and transmitting forbidirectional communication on two-way communication links. Such devicemay include: a cellular or other communication device with or withoutmultiplex display; a PCS that may incorporate functions of speech anddata process as well as facsimile and/or data communication; a PDA thatmay comprise RF receiver and receivers of pager, access ofInternet/Intranet, web browser, notepad, calendar and/or GPS; and/orconventional, laptop or palmtop computer or other devices provided withRF receiver. The “UE” and “terminal” used herein may be handheld,transportable, installable in (aero, marine and/or land) communicationmedias or adaptive and/or configured to operate locally and/or operatein distributed at any other locations on the earth/in the space. The“UE” and “terminal” used herein may also be communication terminal,internet terminal and music/video player terminal, such as PDA, MID(Mobile Internet Device) and/or mobile phones with functions ofmusic/video play. The “UE” and “terminal device” used herein may also bedevices such as smart television and set top box.

FIG. 1 is a flowchart illustrating a method for realizing Chinesecharacter input based on uncertainty information according to anembodiment of the present invention. As shown in FIG. 1, the method forrealizing Chinese character input based on uncertainty information inthe present embodiment comprises steps as below.

Step 101, receiving input information from a user.

As an embodiment of the present invention, the step of receiving inputinformation from a user comprises:

detecting a user operation within an input area;

recording at least two types of uncertainty information input by a useroperation.

Step 102, extracting at least two types of uncertainty information ofChinese characters to be input, from the input information.

Specifically, the uncertainty information includes any one type of thefollowing information: phonetic alphabet (Pinyin) information, toneinformation, structural configuration information, radical information,homophone information, and location information. The phonetic alphabetinformation includes: phonetic alphabet character string information forseparating phonetic alphabet strings according to calligraphycharacteristic and writing area of handwriting; and, phonetic alphabettone information for separating phonetic alphabet tones according tocalligraphy characteristic and writing area of the handwriting ofphonetic alphabet tone. The structural configuration information is usedfor separating the structural configuration information according tocalligraphy characteristic and writing area of the structuralconfiguration information. The radical information is used forseparating radicals which can be determined by the user according tocalligraphy characteristic and writing area of the structuralconfiguration information. The homophone information is used forseparating the homophone information according to calligraphycharacteristic and writing area of the handwriting of homophone.

The tone information includes: information about level tone, risingtone, falling-rising tone, or falling tone. As an embodiment of thepresent invention, a terminal device receives the tone information byany one of the following ways:

receiving tone symbols input by a user through handwriting correspondingto the tone information, the tone symbols including , ´, {hacek over ()} or {grave over ( )}; and

receiving a certain number or letter indicative of the tone information.

For example, numbers 1, 2, 3 and 4 can be preset to represent the leveltone, the rising tone, the falling-rising tone and the falling tone,respectively; or letters A, B, C and D can be preset to represent thelevel tone, the rising tone, the falling-rising tone and the fallingtone, respectively.

As an embodiment of the present invention, a terminal device receivesthe location information by any one of the following ways: receiving ablank or a certain character input by the user. For example, a questionmark “?” can be preset to indicate the location information.

In practical implementation, by including the location information ofChinese characters to be input, it facilitates the positioning of theChinese characters to be input. For example, for a user who has no ideaof how to write the third character “

” in the phrase “

”, he/she can consecutively input “

?

” by replacing the third character “

” with a question mark or a space, in aid of radical information andhomophone information in the uncertainty information, he/she can inputthe correct Chinese characters even based on incomplete memory ofpronunciation or glyph information of the Chinese characters to beinput.

Step 103, determining the matched Chinese characters according to the atleast two types of uncertainty information and outputting the matchedChinese characters.

As an embodiment of the present invention, the step of determining thematched Chinese characters according to the at least two types ofuncertainty information and outputting the matched Chinese characterscomprises:

determining a priority for the uncertainty information;

matching the Chinese characters to be input in a sequence based on thepriority according to the uncertainty information in correspondingmodels; and,

determining Chinese characters with a highest matching degree as theChinese characters to be input, and outputting the Chinese characterswith a highest matching degree.

In the present application scenario, by the step of receiving inputinformation from a user, the step of extracting at least two types ofuncertainty information of Chinese characters to be input, from theinput information, and the step of determining the matched Chinesecharacters according to the at least two types of uncertaintyinformation and outputting the matched Chinese characters, it allows auser who just has incomplete memory of pronunciation or glyphinformation of the Chinese characters to be input to realize correctinput of the Chinese characters by defining a certain range forcandidate characters corresponding to the Chinese characters to beinput, in combination with at least two types of extracted uncertaintyinformation of the Chinese characters to be input, wherein theuncertainty information includes, for example, phonetic alphabetinformation, tone information, structural configuration information,radical information, homophone information, and location information.

FIG. 2 is a flowchart illustrating a method for realizing Chinesecharacter input based on uncertainty information according to anotherembodiment of the present invention. As shown in FIG. 2, the method forrealizing Chinese character input based on uncertainty information inthe present embodiment comprises steps as below.

Step 201, receiving input information from a user.

As an embodiment of the present invention, the step of receiving inputinformation from a user comprises:

detecting a user operation within an input area; and

recording at least two types of uncertainty information input by a useroperation.

Step 202, receiving the input of the user and setting a priority for theuncertainty information.

Step 203, extracting at least two types of uncertainty information ofChinese characters to be input, from the input information.

Specifically, the uncertainty information includes any one type of thefollowing information: phonetic alphabet (Pinyin) information, toneinformation, structural configuration information, radical information,homophone information, and location information. The phonetic alphabetinformation includes: phonetic alphabet character string information forseparating phonetic alphabet strings according to calligraphycharacteristic and writing area of handwriting; and, phonetic alphabettone information for separating phonetic alphabet tones according tocalligraphy characteristic and writing area of the handwriting ofphonetic alphabet tone. The structural configuration information is usedfor separating the structural configuration information according tocalligraphy characteristic and writing area of the structuralconfiguration information. The radical information is used forseparating radicals which can be determined by the user according tocalligraphy characteristic and writing area of the structuralconfiguration information. The homophone information is used forseparating the homophone information according to calligraphycharacteristic and writing area of the handwriting of homophone.

FIG. 3 is a schematic diagram illustrating the category of thestructural configuration information in a method for realizing Chinesecharacter input based on uncertainty information according to anotherembodiment of the present invention. As shown in FIG. 3, the structuralconfiguration information involved in the present invention can becategorized as below: based on structural characteristics, Chinesecharacters can be generally categorized into several types in terms ofstructural configuration, including: sole structure, up-down structure,up-middle-down structure, left-right structure,left-middle-right-structure, up-sole structure, down-sole structure,left-sole structure, right-sole structure, half-enclosed structure,enclosed structure, and a structure formed of three same symbolspyramided.

The “solutions corresponding to the combination of uncertaintyinformation” involved in the present application scenario will bedescribed in details as below.

There may be following eight solutions for input according to the typeof uncertainty information that can be provided by a user. It will bemore liable to obtain the required Chinese characters quickly andaccurately if more information is input into the system and more priorknowledge is thus acquired by the system.

For clarity of description, mapping relations between solution, phoneticalphabet, phonetic alphabet tone, structural configuration, radical andhomophone can be established, as particularly shown in Tab. 1.

TABLE 1 mapping relations between solution, phonetic alphabet, phoneticalphabet tone, structural configuration, radical and homophone PhoneticStructural Solution alphabet Tone configuration Radical HomophoneSolution 1 √ Solution 2 √ √ Solution 3 √ √ Solution 4 √ √ √ Solution 5 √√ Solution 6 √ √ √ Solution 7 √ √ Solution 8 √ √

Actually, the above eight examples are just common combined solutionsthat may be used by the users, and there are plenty of other solutionsdifferent in combination mode. A single type of uncertainty information,itself, can also constitute a solution independent of others. Forexample, if the structural configuration information indicated by a“block” is used alone, it may be confused with certain radicals ofChinese characters, which may result in recognizing the structuralconfiguration information, which is of a single type, as a Chinesecharacter by mistake; furthermore, it may lose accuracy for eventuallyoutputting a correct Chinese character when the mistaken Chinesecharacter happens to fit into the context. By the same token, homophonesinvolve the similar problem.

It is preferable to use the radical information along with thestructural configuration information, in pairs, so as to define alocation of the radical in the character in a better way, and hence toreduce the complexity involved in process for the system.

Step 204, displaying the uncertainty information input by the user onthe screen.

In practical application, the embodiment of the present inventiondiffers from the prior art in that it can display uncertaintyinformation input by the user on the screen; for example, it is capableof displaying at least two types of uncertainty information. This isobviously different from the prior art. According to the prior art, onlyone type of uncertainty information, either phonetic alphabet or glyphinformation of the Chinese characters to be input, can be extracted. Bycontrast, the method of Chinese character input as provided by theembodiment of the present invention is capable of providing more thantwo types of uncertainty information and displaying the same, whichallows taking full advantage of the location information of a phrase.For example, when a user, who wants to input the phrase “

”, forgets how to exactly write the character “

” but remembers the following information: the location of the forgottencharacter “

” in the context, that is, it is the third character in the phrase; thetwo characters in front of the forgotten character constitute asub-phrase in an order of “

” and “

”; and the fourth character in the phrase is “

”. Such information can be combined with other information concerningthe forgotten character such as: its phonetic alphabet is “jiao”, it ispronounced at the second tone, it belongs to left-right structure inconfiguration, it includes a radical “

” in its right portion, and one of its homophones is “

”. In this way, even if a user just has incomplete memory ofpronunciation or glyph information of the Chinese characters to beinput, he/she can still accurately and conveniently input the expectedChinese characters by means of the more than two types of uncertaintyinformation input by the user and displayed on the screen.

Further, the updated uncertainty information is displayed on the screenwhen detecting a modification of the uncertainty information by the userwithin an input area.

In practical application, the embodiment of the present inventiondiffers from the prior art in that it can display the updateduncertainty information on the screen when detecting a modification ofthe uncertainty information by the user within an input area. This isobviously different from the prior art. According to the prior art, onlyone type of uncertainty information, either phonetic alphabet or glyphinformation of the Chinese characters to be input, can be extracted. Bycontrast, the method for Chinese character input as provided by theembodiment of the present invention is capable of providing more thantwo types of uncertainty information and meanwhile displaying theupdated uncertainty information on the screen when detecting amodification of the uncertainty information by the user within an inputarea, which allows taking full advantage of the location information ofa phrase. For example, when a user, who wants to input the phrase “

”, forgets how to exactly write the character “

” but remembers the following information: the location of the forgottencharacter “

” in the context, that is, it is the third character in the phrase; thetwo characters in front of the forgotten character constitute asub-phrase in an order of “

” and “

”; and the fourth character in the phrase is “

”. Such information can be combined with other information concerningthe forgotten character such as: its phonetic alphabet is “jiao”, it ispronounced at the second tone, it belongs to left-right structure inconfiguration, it includes a radical “

” in its right portion, and one of its homophones is “

”. In this way, even if a user just has incomplete memory ofpronunciation or glyph information of the Chinese characters to beinput, he/she can still accurately and conveniently input the expectedChinese characters by means of the more than two types of uncertaintyinformation input by the user and displayed on the screen.

Specifically, for example, when a user, who wants to input the phrase “

”, forgets how to exactly write the character “

” but remembers the following information: the location of the forgottencharacter “

” in the phrase, that is, it is the third character in the phrase; thetwo characters in front of the forgotten character constitute asub-phrase in an order of “

” and “

”; and the fourth character in the phrase is “

”. The method can be directed to utilize the location information of thecharacter to be input, comprising:

if the uncertainty information includes two types of information, theinformation to be combined is that: it is the third character, and itsphonetic alphabet is “jiao”;

if the uncertainty information includes three types of information, theinformation to be combined is that: it is the third character, itsphonetic alphabet is “jiao”, and it is pronounced at the second tone;and

if the uncertainty information includes four types of information, theinformation to be combined is that: it is the third character, itsphonetic alphabet is “jiao”, it is pronounced at the second tone, and itbelongs to left-right structure in configuration.

As above, it can be seen that, including more types of uncertaintyinformation can narrow the collection of candidate characters determinedfrom the uncertainty information as far as possible, so as to narrow thenumber of phrases constituted by said character and the context, and toreduce the computation time for the language model to analyze, therebyimproving the calculation accuracy. Consequently, quick and convenientinput of Chinese character, of which the particular glyph or phoneticalphabet is uncertain, can be realized.

By displaying the updated uncertainty information on the screen whendetecting a modification of the uncertainty information by the userwithin an input area, the method for Chinese character input as providedby the embodiment of the present invention possesses improvement in bothpracticality and operability.

Step 205, determining the matched Chinese characters according to the atleast two types of uncertainty information and outputting the matchedChinese characters.

As an embodiment of the present invention, the step of determining thematched Chinese characters according to the at least two types ofuncertainty information and outputting the matched Chinese characterscomprises:

selecting corresponding models according to the at least two types ofuncertainty information;

matching the Chinese characters to be input in at least two types ofcorresponding models;

analyzing matching results for each of the at least two types ofcorresponding models to determine Chinese characters with a highestmatching degree as the Chinese characters to be input; and,

outputting the Chinese character with a highest matching degree.

As an embodiment of the present invention, further comprising steps of:

determining a priority for the uncertainty information; and

matching the Chinese character to be input in a sequence according tothe priority, in the corresponding model.

As an embodiment of the present invention, further comprising steps of:

receiving input information from a user, and setting a priority for theuncertainty information.

In practical application, by setting a priority for the uncertaintyinformation, the present invention is obviously different from the inputmethod of the prior art, wherein the priority affects the efficiency andthe matching effect.

Specifically, by setting a priority for the uncertainty information, itwill be more quick and convenient to find the Chinese characters to beinput on the screen. Moreover, different types of uncertaintyinformation exhibit different levels of importance in matching differentChinese characters to be input.

For example, if three types of uncertainty information including thelocation information, the phonetic alphabet information and the toneinformation are utilized, it can configure a priority for the abovethree types of information in advance. Specifically, it may be arrangedsuch that the location information has the primary priority and thuswill take primary precedence on execution; the phonetic alphabetinformation has the secondary priority and thus will take secondaryprecedence on execution; and the tone information has the lowestpriority and will be the last executed operation.

When it is expected to input the Chinese character “

” from the phrase “

”, the technical solution according to the present invention provides auser with plenty of ways to determine the character “

” to be input. For example, the user can determine the character bymeans of the pronunciation information, the radical information, thetone information, the structural configuration information and thelocation information, etc. For instance, if the uncertainty informationinput by the user includes: the character is left-side structured with “

” in its right portion, and the character is pronounced at the secondtone; the Chinese character to be input may be “

” from phrase “

”, or may be “

” from phrase“

”. If the location information “

” is further input, it will still be difficult to distinguish “

” from “

” simply based on the information “the character to be input occupiesthe third position in the possible phase” indicated by the locationinformation; but it will be easy to exclude the candidate character “

” based on the information “

” and “

” contained in the location information.

Further, for example, when a user is very confident in finding a Chinesecharacter through its pronunciation, he/she may preset the toneinformation and/or homophone information with a higher priority; when auser is better at finding a Chinese character through handwriting,he/she may preset the structural configuration information and/orradical information in the uncertainty information with a higherpriority.

Input examples and processing flow thereof under different types ofsolutions will be illustrated as below.

Scenario: a user, who wants to input a Chinese text including fourChinese characters “

” on a handwriting input interface, forgets how to exactly write thecharacter “

”.

Solution 1: the input scene of the user may be as follows:

input scene: “

jiao

”.

Solution 2: the input scene of the user may be as follows:

input scene: “

jiao

”, and it is also displayed that the input “jiao” is pronounced at thesecond tone.

Solution 3: the input scene of the user may be as follows:

input scene: “

jiao

”, and it is also displayed that the character to be input is left-rightstructured.

Solution 4: the input scene of the user may be as follows:

input scene: “

jiao

”, and it is also displayed that the character to be input is left-rightstructured and the input “jiao” is pronounced at the second tone.

Solution 5: the input scene of the user may be as follows:

input scene: “

”, and it is also displayed that the input “

” is pronounced at the second tone.

Solution 6: the input scene of the user may be as follows:

input scene: “

”, and it is also displayed that the input character “

” is left-right structured and is pronounced at the second tone.

Solution 7: the input scene of the user may be as follows:

input scene: “

”, and it is also displayed that the input character “

” is left-right structured.

Solution 8: the input scene of the user may be as follows:

input scene: “

”, and it is also displayed that the missing character is left-rightstructured and has a glyph “

” in its right portion.

In certain cases, a user may forget how to write more than one characterfrom a phrase to be input. Again, taking the phrase “

” as an example.

Solution 9: the input scene of the user may be as follows:

input scene: “

jiao la”.

Under various input scenes illustrated above, when inputting calligraphydata in a certain format, the processing flow may be as follows:

Step 1: The calligraphy points input by the user are received by areceiving module, and transmitted to a preprocessing module fornecessary preprocessing to obtain better recognition results.

Step 2: A boundary for each character is determined by a segmentingmodule.

Step 3: Calligraphy points pertaining to uncertainty information arerecognized from the preprocessed calligraphy data points, and thenseparated from other calligraphy points pertaining to the Chinese text,by an uncertainty information detecting module.

Step 4: At least one type of uncertainty information is separated fromthe calligraphy points pertaining to uncertainty information asdetermined in step 2, by a phonetic alphabet character string detectingmodule/phonetic alphabet tone detecting module/structural configurationinformation detecting module/radical detecting module/homophonedetecting module.

Step 5: The information that is separated in step 3 is recognized by aphonetic alphabet recognizing module/phonetic alphabet tone recognizingmodule/structural configuration category recognizing module/radicalrecognizing module/homophone recognizing module (i.e., Chinese characterrecognizing module).

Step 6: The calligraphy points pertaining to the Chinese text that areseparated in step 2 are recognized to obtain candidate Chinesecharacters.

Step 7: The recognition results of the homophone recognizing module arerequired to be converted into phonetic alphabet strings by a Chinesecharacter-to-phonetic alphabet converting module.

Step 8: The phonetic alphabet strings that are recognized in step 4 andthe phonetic alphabet strings that are converted in step 6 are requiredto be processed by a phonetic alphabet plus tone-to-character convertingmodule, so as to be converted into candidate Chinese characters. If thetone information is missing, this step performs a phoneticalphabet-to-character conversion, instead.

Step 9: The structural configuration information that is recognized instep 4 is required to be converted into candidate Chinese characters bya structural configuration category-to-character converting module.

Step 10: The Chinese characters that are obtained from step 7 and step 8are screened according to the recognition results of the radicalrecognizing module, and re-scored and sorted by a character screeningand merging module, in order to obtain a group of candidate Chinesecharacters.

Step 11: The candidate Chinese characters obtained from step 9 arecombined with the candidate contexts obtained from step 5 to constitutephrases; these phrases are sorted from high to low in terms of scorescalculated through language models, so as to output candidatephrases/sentences.

Step 12: The results are output.

In case that the available information for each character location atleast contains either phonetic alphabet or Chinese character as shown inabove solutions 1-7, another processing method, among the foregoingones, is to convert all the input information into phonetic alphabetsand Chinese characters, then to recognize and score the characterstrings under both a phonetic alphabet model and a Chinese charactermodel, and finally to merge the respectively obtained candidatecharacters into a uniform candidate sequence for output. The particularprocessing flow may be as follows:

Step 21: Calligraphy points input by the user are received by thereceiving module and transmitted to the preprocessing module fornecessary preprocessing to obtain better recognition results.

Step 22: A boundary for each character is determined by the segmentingmodule.

Step 23: Each character obtained from step 22 is converted into Chinesecharacter. If the character location itself is input as a Chinesecharacter, the candidate Chinese characters are sorted based onconfidence coefficient, after recognition. If the character locationitself is input as phonetic alphabet, candidate Chinese characters aregenerated based on recognition results of the phonetic alphabet string.The conversion results shall make sure that each character locationcorresponds to at least one candidate Chinese character.

Step 24: The results of step 23 are merged and spliced by the characterscreening and merging module and a scoring function of a language-modelscoring module for Chinese character language model, so as to obtaincandidate phrases.

Step 25: Each character in the results obtained from step 22 isconverted into phonetic alphabet. If the character location itself isinput as phonetic alphabet, the recognized phonetic alphabet string isoutput. If the character location itself is input as a Chinesecharacter, regardless of whether this Chinese character is an exactlyright one or a homophone, the recognized candidate Chinese character isconverted into candidate phonetic alphabet string by the Chinesecharacter-to-phonetic alphabet converting module. The conversion resultsshall make sure that each character location corresponds to at least onecandidate Chinese character.

Step 26: The results of step 25 are merged and spliced by the scoringfunction of the language-model scoring module for phonetic alphabetstring model, so as to obtain candidate phrases.

Step 27: The results of step 24 and step 25 are merged by a candidatescreening and merging module according to merging rules as follows:

A. if the sequence contains a same candidate phrase, bringing thiscandidate phrase advance;

B. counting the occurrence frequency of characters for each candidatephrase, and if a candidate phrase contains more characters with higheroccurrence frequency than others, bringing this candidate phraseadvance;

C. merging two candidate phrases from two candidate sequences into onecandidate sequence, according to respective scores of the candidatephrases.

Step 28: The merging results are output.

FIG. 4 is a flowchart of solution 1 of a method for realizing Chinesecharacter input based on uncertainty information according to anotherembodiment of the present invention. As shown in FIG. 4, in particular,the flow of solution 1 of the method for realizing Chinese characterinput based on uncertainty information according to another embodimentof the present invention comprises steps of:

receiving calligraphy points input by the user;

performing necessary preprocessing to the calligraphy points, in orderto obtain better recognition results;

determining a boundary for each character by segmenting;

extracting the calligraphy points pertaining to uncertainty informationfrom the preprocessed calligraphy data points, and then separating thecalligraphy points pertaining to uncertainty information from othercalligraphy points pertaining to the Chinese text;

detecting the uncertainty information; if any uncertain information isdetected, performing corresponding operation, including: keeping onseparating the uncertainty information from the obtained calligraphypoints pertaining to uncertainty information, by means of phoneticalphabet character information; recognizing the separated informationthat does not belong to the Chinese text by the phonetic alphabetrecognizing module; converting the recognized phonetic alphabetcharacter string into candidate Chinese characters, by means of aphonetic alphabet-to-character conversion; sending the candidate Chinesecharacters into a phonetic alphabet string language model to calculatethe accurate rate of the merged phrases by scoring the language modelaccording to the counted Chinese phrases/phonetic alphabet strings andthe occurrence frequency thereof; merging the recognition results, andoutputting the merging result;

if no uncertainty information is detected, recognizing the Chinesecharacters, i.e., recognizing Chinese-text characters from which theuncertainty information has been separated and recognizing thehomophones in the uncertainty information; sending the recognitionresults into the Chinese character language model to calculate theaccurate rate of the merged phrases by scoring the language modelaccording to the counted Chinese phrases/phonetic alphabet strings andthe occurrence frequency thereof; merging the recognition results, andoutputting the merging result.

Solution 1, taking the phonetic alphabet string as the one and uniqueuncertainty information, is the simplest one among the total ninesolutions. This solution is just the only solution to solve the problemof forgetting how to exactly write a Chinese character in the existinginput mode.

FIG. 5 is a flowchart of solution 4 of a method for realizing Chinesecharacter input based on uncertainty information according to anotherembodiment of the present invention. As shown in FIG. 5, in particular,the processing flow of solution 4 of the method for realizing Chinesecharacter input based on uncertainty information according to anotherembodiment of the present invention comprises steps to be described asbelow.

Solution 4 is a relatively complex one among the total nine solutionsabove, comprising three types of uncertainty information including thephonetic alphabet string, the phonetic alphabet tone and the structuralconfiguration. It shares the processing flow with solution 1 as shown inFIG. 4, while the only difference is incorporating both the detectedphonetic alphabet tone information and structural configurationinformation into the uncertainty information which already contains thephonetic alphabet string. Thus it is unnecessary to go into detailsherein.

Due to the incorporation of other two types of uncertainty information,a process to screen and merge the characters will be introduced,comprising: sorting the candidate sequences which are obtained fromdifferent process modes on different basis (phonetic alphabet or Chinesecharacter) under the same input, in order to generate a uniformcandidate sequence.

Comparing with solution 1, solution 4 involves more types of informationand thus more uncertainty information. As a result, it can narrow thecollection of characters determined from the uncertainty information asfar as possible, so as to narrow the number of phrases constituted bysaid character and the context, and to further reduce the computationtime for scoring the language model, thereby improving the calculationaccuracy and the overall system performance.

In the embodiment above, by a step of firstly receiving inputinformation from a user, a step of extracting at least two types ofuncertainty information of Chinese characters to be input, from theinput information, and a step of determining the matched Chinesecharacters according to the at least two types of uncertaintyinformation and outputting the matched Chinese character, it allows auser who just has incomplete memory of pronunciation or glyphinformation of the Chinese character to be input to realize correctinput of the Chinese character by defining a certain range for candidatecharacters corresponding to the Chinese character to be input, incombination with at least two types of extracted uncertainty informationof the Chinese character to be input, wherein the uncertaintyinformation includes, for example, phonetic alphabet (Pinyin)information, tone information, structural configuration information,radical information, homophone information, and location information.

The embodiments of the present invention, on another aspect, provide amethod for realizing Chinese character input based on uncertaintyinformation, comprising steps of:

receiving input information from a user;

extracting uncertainty information of Chinese characters to be input,from the input information, wherein the uncertainty information includesany one or more types of the following information:

tone information, structural configuration information, radicalinformation, homophone information, and location information; and,

determining and outputting the Chinese character to be input based onthe determined Chinese characters and the uncertainty information.

In an embodiment of the present invention, it can correctly and quicklydetermine the Chinese character to be input by means of thepredetermined Chinese characters such as the context content of theChinese character to be input, in combination with one or more type ofuncertainty information including tone information, structuralconfiguration information, radical information, homophone information,or location information. That is to say, the solution as provided by thepresent embodiment can realize Chinese character input even based ononly one type of uncertainty information.

Specifically, for example, when it is expected to input four Chinesecharacters “

”, if the user has determined the Chinese characters “

” and “

” from the phrase but forgets how to exactly write the Chinese character“

”, he/she can easily determine the forgotten one is “

” by combining with any one or more type of uncertainty informationthereof, including tone information, structural configurationinformation, radical information, homophone information, or locationinformation; for example, the phonetic alphabet for this Chinesecharacter is pronounced at the second tone, or, this Chinese characteris left-right structured, or, this Chinese character has a homophonewritten as “

” among others, or, this Chinese character occupies the third positionin the phrase. In other words, the method can correctly and quicklyoutput the Chinese character “

” to be input, based on the predetermined Chinese characters (the firstand second ones arranged in an order of “

”, and the fourth one “

”) in combination with the uncertainty information (the phoneticalphabet for this Chinese character is pronounced at the second tone,or, this Chinese character is left-right structured, or, this Chinesecharacter has a homophone written as “

” among others, or, this Chinese character occupies the third positionin the phrase).

In the present application scenario, as comparison with the foregoingsolutions in which the Chinese character is matched and outputted basedon at least two types of uncertainty information, the present solutiontakes full advantageous of the predetermined context content of theChinese character to be input, by a step of receiving input informationfrom a user, a step of extracting uncertainty information of Chinesecharacters to be input, from the input information, wherein theuncertainty information includes any one or more types of toneinformation, structural configuration information, radical information,homophone information, and location information, and a step ofdetermining and outputting the Chinese character to be input based onthe determined Chinese characters and the uncertainty information, so asto correctly and quickly determine and output the Chinese character tobe input simply based on any one type of tone information, structuralconfiguration information, radical information, homophone information,or location information.

The process of determining and outputting the Chinese character to beinput based on the predetermined Chinese characters and the uncertaintyinformation in the present embodiment is similar with the process ofdetermining and outputting the matched Chinese character based on atleast two types of uncertainty information in the foregoing embodiments,which have been described in details as above, thus the foregoingembodiments in the solutions of determining and outputting the matchedChinese character based on at least two types of uncertainty informationare also applicable for the solution of determining and outputting theChinese character to be input based on the determined Chinese charactersand the uncertainty information in the present embodiment. It istherefore unnecessary to go into details herein, and reference may bemade to the corresponding description.

FIG. 6 is a structural diagram illustrating a device 600 for realizingChinese character input based on uncertainty information according to anembodiment of the present invention. As shown in FIG. 6, the device 600for realizing Chinese character input based on uncertainty informationin the present embodiment comprises a receiving module 610, anextracting module 620 and a matching module 630.

The receiving module 610 is configured for receiving input informationfrom a user.

As an embodiment of the present invention, the receiving module 610receives input information from a user by steps of:

detecting an user operation within an input area; and

recording at least two types of uncertainty information input by a useroperation.

Specifically, the receiving module 610 is configured for receiving thetone information by any one of the following ways:

receiving tone symbols input by a user through handwriting correspondingto the tone information, the tone symbols including , ´, {hacek over ()} or {grave over ( )};

receiving a certain number or letter indicative of the tone information.

Specifically, the receiving module 610 is configured for receiving thelocation information by any one of the following ways:

receiving a blank or a certain character input by the user.

The extracting module 620 is configured for extracting at least twotypes of uncertainty information of Chinese characters to be input, fromthe input information.

Specifically, the uncertainty information extracted by the extractingmodule 620 includes any one or more types of the following information:

phonetic alphabet information, tone information, structuralconfiguration information, radical information, homophone information,and location information. Wherein, the phonetic alphabet informationincludes: phonetic alphabet character string information for separatingphonetic alphabet strings according to calligraphy characteristic andwriting area of handwriting; and, phonetic alphabet tone information forseparating phonetic alphabet tones according to calligraphycharacteristic and writing area of the handwriting of phonetic alphabettone. The structural configuration information is used for separatingthe structural configuration information according to calligraphycharacteristic and writing area of the structural configurationinformation. The radical information is used for separating radicalswhich can be determined by the user according to calligraphycharacteristic and writing area of the structural configurationinformation. The homophone information is used for separating thehomophone information according to calligraphy characteristic andwriting area of the handwriting of homophone.

The matching module 630 is configured for determining a matched Chinesecharacter according to the at least two types of uncertainty informationand outputting the matched Chinese character.

Specifically, the matching module 630 determines a matched Chinesecharacter according to the at least two types of uncertainty informationand outputs the matched Chinese character by steps of:

selecting corresponding models according to the at least two types ofuncertainty information;

matching the Chinese characters to be input in at least two types ofcorresponding models;

analyzing matching results for each of the at least two types ofcorresponding models, determining Chinese characters with a highestmatching degree as the Chinese characters to be input; and,

outputting the Chinese characters with a highest matching degree.

Specifically, the matching module 630 is further configured for:

determining a priority for the uncertainty information; and

matching the Chinese characters to be input in a sequence according tothe priority, in the corresponding model.

In the embodiment mentioned above, by a step of receiving inputinformation from a user, a step of extracting at least two types ofuncertainty information of Chinese characters to be input, from theinput information, and a step of determining the matched Chinesecharacters according to the at least two types of uncertaintyinformation and outputting the matched Chinese characters; it allows auser who just has incomplete memory of pronunciation or glyphinformation of the Chinese characters to be input to realize correctinput of Chinese character by defining a certain range for candidatecharacters corresponding to the Chinese characters to be input, incombination with at least two types of extracted uncertainty informationof the Chinese characters to be input, wherein the uncertaintyinformation includes, for example, phonetic alphabet (Pinyin)information, tone information, structural configuration information,radical information, homophone information, and location information.

FIG. 7 is a structural diagram illustrating a device for realizingChinese character input based on uncertainty information according toanother embodiment of the present invention. As shown in FIG. 7, in thepresent embodiment, the device for realizing Chinese character inputbased on uncertainty information further comprises: a setting module640, an updating module 650 and a storing module 660.

As an embodiment of the above device for realizing Chinese characterinput based on uncertainty information, the setting module 640 isconfigured for receiving input information from a user and setting apriority for the uncertainty information.

Further, as an embodiment of the above device for realizing Chinesecharacter input based on uncertainty information, the updating module650 is configured for displaying the updated uncertainty information onthe screen when detecting a modification of the uncertainty informationby the user within an input area.

Further, the storing module 660 is configured for storing inputinformation of the user, including, for example, calligraphy pointsinput by a user through handwriting, language models, a mapping tablebetween phonetic alphabet and Chinese character, as well as recognitionlibrary of Chinese character and phonetic alphabet.

Additionally, the processor contained in the device according to thepresent embodiment further provides the following functions:

performing necessary preprocessing procedures on the information whichis input by the user and received by the receiving module 610,including: preprocessing, denoising, and tilt correction, etc.;

recognizing a boundary for each character by segmenting;

distinguishing the uncertainty information from the Chinese text in theinformation input by the user, according to the format of theuncertainty information extracted by the extracting module 620;

obtaining phonetic alphabet character strings according to thecalligraphy of phonetic alphabet, so as to recognize the phoneticalphabet calligraphy information;

obtaining structural-configuration category according to the calligraphyof structural configuration, so as to recognize the structuralconfiguration information;

recognizing the phonetic alphabet tones, the radicals and thehomophones;

recognizing the characters in the Chinese text from which theuncertainty information has been separated, and recognizing thehomophones in the uncertainty information, so as to recognizecorresponding Chinese characters;

based on recognition results of Chinese character, obtainingcorresponding phonetic alphabet strings of the recognized Chinesecharacter by means of a “Chinese character-to-phonetic alphabet”conversion through table lookup;

based on a mapping table between the combined information of phoneticalphabet string with tone (phonetic alphabet string+tone) and theChinese character, obtaining conversion results of a “phonetic alphabetplus tone-to-character” conversion through table lookup;

based on recognition results of structural configuration category,mapping the recognized structural configuration category into acharacter collection corresponding to the category by means of a“structural configuration category-to-character” conversion throughtable lookup;

a process of screening and merging characters, comprises: screening(according to recognition results of radical, if existed) a charactercollection derived from the uncertainty information of different types(phonetic alphabet, structural configuration, homophone), and mergingthe screened characters (according to a group of scores sorted from highto low, which is obtained by scoring in terms of weights), so as toscreen and sort the candidate characters obtained based on differenttypes of uncertainty information;

a process of screening and merging the candidate characters, comprising:sorting more than one candidate sequence obtained on different basis,according to certain rules, and merging these sorted ones into acandidate sequence for final output.

a process of scoring language model, comprising a process of scoring theChinese character language model and a process of scoring the phoneticalphabet string language model, which provide the following functions:combining the candidate Chinese character corresponding to the phoneticalphabet or the candidate phonetic alphabet string corresponding to theChinese character, with the Chinese character or phonetic alphabetstring of the context; scoring different combinations in this modulesuch that the combination with a higher score has a higher ranking, soas to calculate the accurate rate of the spliced phrase according to thecounted Chinese phrases/phonetic alphabet strings and the occurrencefrequency thereof by model scoring.

In the embodiment above, by a step of receiving input information from auser, a step of extracting at least two types of uncertainty informationof Chinese characters to be input, from the input information, and astep of determining the matched Chinese characters according to the atleast two types of uncertainty information and outputting the matchedChinese characters, it allows a user who just has incomplete memory ofpronunciation or glyph information of the Chinese characters to be inputto realize correct input of Chinese character by defining a certainrange for candidate characters corresponding to the Chinese charactersto be input, in combination with at least two types of extracteduncertainty information of the Chinese characters to be input, whereinthe uncertainty information includes, for example, phonetic alphabet(Pinyin) information, tone information, structural configurationinformation, radical information, homophone information, and locationinformation.

The embodiments of the present invention, on another aspect, provide adevice for realizing Chinese character input based on uncertaintyinformation, comprising a receiving module, an extracting module and amatching module:

the receiving module is configured for receiving input information froma user;

the extracting module is configured for extracting uncertaintyinformation of Chinese characters to be input, from the inputinformation, wherein the uncertainty information includes any one ormore types of the following information:

tone information, structural configuration information, radicalinformation, homophone information, and location information;

the matching module is configured for determining and outputting theChinese characters to be input based on the determined Chinesecharacters and the uncertainty information.

The process for the matching module to determine and output the Chinesecharacters to be input based on the determined Chinese characters andthe uncertainty information in the present embodiment is similar withthe process of determining and outputting the matched Chinese charactersbased on at least two types of uncertainty information in the foregoingembodiments of method or device, which have been described in details asabove, thus the foregoing embodiments in the solutions of determiningand outputting the matched Chinese characters based on at least twotypes of uncertainty information are also applicable for the solution ofdetermining and outputting the Chinese characters to be input based onthe determined Chinese characters and any one more type of theuncertainty information in the present embodiment. It is thereforeunnecessary to go into details herein, and reference may be made to thecorresponding description.

FIG. 8 is a schematic diagram illustrating a process of determiningChinese characters with the highest matching degree based on uncertaintyinformation in a case where the available information for each characterlocation at least includes either phonetic alphabet or Chinesecharacter, comprising the processing flow described as below.

Corresponding models are selected according to the at least two types ofuncertainty information, of which the particular processing flow will bedescribed with following examples.

Input information “

jiao la” is received and converted into Chinese characters and phoneticalphabet strings, respectively, for purpose of taking full advantageousof available information as far as possible; modeling analysis isperformed on the input uncertainty information under the Chinesecharacter model and the phonetic alphabet model, respectively, to obtainrespective results; and the results are merged to improve thereliability of the results.

The Chinese characters or phonetic alphabet strings obtained byconversion during the above process are displayed.

Specifically, if the Chinese character language model is selectedaccording to the at least two types of uncertainty information containedin “

jiao la”, the following processing flow is performed:

matching the Chinese characters “

jiao la” to be input in the Chinese character language model, thepossible results including“

”, “

” and “

”; and inputting all of the information containing the above Chinesecharacters “

”, “

” and “

” into the Chinese character language model for matching, the possiblecombinations of Chinese characters including “

”, “

”, “

”, or “

”.

Specifically, if the phonetic alphabet string language model is selectedaccording to the at least two types of uncertainty information containedin “

jiao la”, the following processing flow is performed:

matching the Chinese characters “

jiao la” to be input, in the phonetic alphabet string language model,

in which step, “

jiao la” may be converted into phonetic alphabet strings as follows:“wei tong jiao la”, “mei wen qiao” and “shu bao”;

inputting all of the information containing the above phonetic alphabetstrings “wei tong jiao la”, “mei wen qiao” and “shu bao” into thephonetic alphabet string language model for matching,

the possible combinations of Chinese characters including “

”, “

”, “

”, and “

”.

Two candidate sequences are merged according to certain rules. Forexample, it is determined whether there are same results in thecharacter strings obtained under the two models, if so, the same resultsare placed into the merged sequence. As shown in the example, if the twocharacter strings both contain “

”, “

” is used as the first candidate character string of the merging result.Likewise, if the two character strings both contain “

”, “

” is used as the second candidate character string of the mergingresult. Subsequently, if the two character strings no longer contain anysame result, the character strings, which are contained in either of thetwo sequences only, are sorted from high to low in terms of scores,according to scoring rules as designed. For example, it may be possibleto directly sort the scores of the two models, or, it may be possible totranslate the scores of the two models into unified score by calculatingweighted parameters according to the model parameters, and then touniformly sort the remaining character strings based on the translatedscores. The following merging results may be obtained: “

”, “

”, “

”, and “

”.

The above matching results of the Chinese character language model andthe phonetic alphabet string language model are analyzed, to determinethe Chinese characters with the highest matching degree as the Chinesecharacters to be input, and the Chinese characters with the highestmatching degree, that is, “

”, are output.

As an embodiment of the present invention, in practical application, theforegoing device for realizing Chinese character input based onuncertainty information is usually implemented by a terminal device,including the device for realizing Chinese character input based onuncertainty information as shown in FIG. 6 or FIG. 7.

The foregoing solutions as provided by the present invention allow auser, who has incomplete memory of pronunciation or glyph information ofChinese characters to be input, to realize correct input of Chinesecharacter by defining a certain range for candidate characterscorresponding to the Chinese characters to be input, in combination withat least two types of extracted uncertainty information of the Chinesecharacters to be input. In addition, the above solutions as provided bythe present invention just make minor modification to the existingsystems, and hence will not influence the system compatibility.Moreover, the implementations of these solutions as provided are bothsimple and highly effective.

It should be appreciated by the person skilled in the art that thepresent invention may involve devices for implementing one or moreoperations described therein. The device may be designed andmanufactured for dedicated purposes as required, or may further comprisewell known devices found in general-purpose computers which areactivated or reconstituted selectively by the programs stored therein.Such computer programs may be stored in device (such as a computer)readable media or stored in any type of medias adaptive to storeelectronic instructions and coupled to a bus. Such computer readablemedia includes, but not limited to, any type of disks/discs (includingfloppy disk, hard disk, optical disk, CD-ROM and magneto optical disk),read-only memory (ROM), random access memory (RAM), Erasableprogrammable Read-Only Memory (EPROM), electrically erasable ROM(EEPROM), flash memory, magnetic card or fiber card. That is to say, thereadable media includes any mechanism storing or transmittinginformation in device (for example, the computer) readable form.

It should be appreciated by the person skilled in the art that eachblock as well as the combination of the blocks in the structural blockgraphs and/or block graphs and/or flowcharts may be implemented throughcomputer program instructions. It should be appreciated by the personskilled in the art that these computer program instructions may beprovided to general-purpose computer, dedicated computer or otherprocessors capable of programming the data processing methods, togenerate machines, so as to implement the methods specified in theblock(s) of the structural block graphs and/or block graphs and/orflowcharts through the instructions executed on the computer or otherprocessors capable of programming the data processing methods.

It should be appreciated by the person skilled in the art that thevarious operations, methods, steps in the flow, measures and schemesdiscussed in the present invention can be alternated, modified, combinedor deleted. Furthermore, other operations, methods, steps in the flow,measures and schemes involving the various operations, methods, steps inthe flow, measures and schemes discussed in the present invention mayalso be alternated, modified, rearranged, dissolved, combined ordeleted. Furthermore, other operations, methods, steps in the flow,measures and schemes having the same functions with the variousoperations, methods, steps in the flow, measures and schemes discussedin the present invention may also be alternated, modified, rearranged,dissolved, combined or deleted.

The description above only illustrates part of the embodiments of thepresent invention. It should be pointed out that, various modificationsand polishes may be made by a person skilled in the art withoutdeparting from the principle of the present invention. Thesemodification and polishes shall also be regarded as the extent ofprotection of the present invention.

What is claimed is:
 1. A method for realizing Chinese character inputbased on uncertainty information, characterized in that, comprising:receiving input information from a user; extracting at least two typesof uncertainty information of Chinese characters to be input, from theinput information; and determining the matched Chinese charactersaccording to the at least two types of uncertainty information andoutputting the matched Chinese characters.
 2. The method for realizingChinese character input based on uncertainty information according toclaim 1, characterized in that the uncertainty information includes anyone type of the following information: phonetic alphabet (Pinyin)information, tone information, structural configuration information,radical information, homophone information, and location information. 3.The method for realizing Chinese character input based on uncertaintyinformation according to claim 2, characterized in that the toneinformation is received by any one of the following ways: receiving tonesymbols input by a user through handwriting corresponding to the toneinformation, the tone symbols including , ´, {hacek over ( )} or {graveover ( )}; and receiving a certain number or letter indicative of thetone information.
 4. The method for realizing Chinese character inputbased on uncertainty information according to claim 2, characterized inthat the location information is received by any one of the followingways: receiving a blank or a certain character input by the user.
 5. Themethod for realizing Chinese character input based on uncertaintyinformation according to any one of claims 2-4, characterized in thatthe step of receiving input information from a user comprises: detectinga user operation within an input area; recording at least two types ofuncertainty information input by a user operation.
 6. The method forrealizing Chinese character input based on uncertainty informationaccording to claim 5, further comprising: displaying the at least twotypes of uncertainty information on the screen.
 7. The method forrealizing Chinese character input based on uncertainty informationaccording to claim 2, characterized in that the step of determining thematched Chinese characters according to the at least two types ofuncertainty information and outputting the matched Chinese characterscomprises: selecting corresponding models according to the at least twotypes of uncertainty information; matching the Chinese characters to beinput in the at least two types of corresponding models; and analyzingmatching results for each of the at least two types of correspondingmodels to determine Chinese characters with a highest matching degree asthe Chinese characters to be input, and outputting the Chinesecharacters with a highest matching degree.
 8. The method for realizingChinese character input based on uncertainty information according toclaim 7, further comprising: determining a priority for the uncertaintyinformation; and matching the Chinese characters to be input in asequence according to the priority, in the corresponding models.
 9. Amethod for realizing Chinese character input based on uncertaintyinformation, comprising: receiving input information from a user;extracting uncertainty information of Chinese characters to be input,from the input information, wherein the uncertainty information includesany one or more types of the following information: tone information,structural configuration information, radical information, homophoneinformation, and location information; and, determining and outputtingthe Chinese characters to be input based on the determined Chinesecharacters and the uncertainty information.
 10. A device for realizingChinese character input based on uncertainty information, comprising areceiving module, an extracting module and a matching module: thereceiving module is configured for receiving input information from auser; the extracting module is configured for extracting at least twotypes of uncertainty information of Chinese characters to be input, fromthe input information; and the matching module is configured fordetermining the matched Chinese characters according to the at least twotypes of uncertainty information and outputting the matched Chinesecharacters.
 11. The device for realizing Chinese character input basedon uncertainty information according to claim 10, characterized in thatthe uncertainty information extracted by the extracting module includesany one type of the following information: phonetic alphabet (Pinyin)information, tone information, structural configuration information,radical information, homophone information, and location information.12. The device for realizing Chinese character input based onuncertainty information according to claim 11, characterized in that thereceiving module is configured for receiving the tone information by anyone of the following ways: receiving tone symbols input by a userthrough handwriting corresponding to the tone information, the tonesymbols including , ´, {hacek over ( )} or {grave over ( )}; receiving acertain number or letter indicative of the tone information.
 13. Thedevice for realizing Chinese character input based on uncertaintyinformation according to claim 11, characterized in that the receivingmodule is configured for receiving the location information by any oneof the following ways: receiving a blank or a certain character input bythe user.
 14. The device for realizing Chinese character input based onuncertainty information according to any one of claims 11-13,characterized in that the receiving module receives the inputinformation from the user by steps of: detecting a user operation withinan input area; recording at least two types of uncertainty informationinput by the user operation.
 15. The device for realizing Chinesecharacter input based on uncertainty information according to claim 14,characterized in that the receiving module is further configured fordisplaying the at least two types of uncertainty information on thescreen.
 16. The device for realizing Chinese character input based onuncertainty information according to claim 15, further comprising anupdating module, the updating module is configured for displaying theupdated uncertainty information on the screen when detecting amodification of the uncertainty information by the user within an inputarea.
 17. The device for realizing Chinese character input based onuncertainty information according to claim 11, characterized in that thematching module determines the matched Chinese characters according tothe at least two types of uncertainty information and outputs thematched Chinese characters by steps of: selecting corresponding modelsaccording to the at least two types of uncertainty information; matchingthe Chinese characters to be input in at least two types ofcorresponding models; and, analyzing matching results for each of the atleast two types of corresponding models to determine Chinese characterswith a highest matching degree as the Chinese characters to be input,and outputting the Chinese characters with a highest matching degree.18. The device for realizing Chinese character input based onuncertainty information according to claim 17, characterized in that thematching module is further configured for determining a priority for theuncertainty information; and matching the Chinese characters to be inputin a sequence according to the priority, in the corresponding models.19. The device for realizing Chinese character input based onuncertainty information according to claim 9, further comprising asetting module, the setting module is configured for receiving inputinformation from a user and setting a priority for the uncertaintyinformation.
 20. A device for realizing Chinese character input based onuncertainty information, characterized in that, comprising a receivingmodule, an extracting module and a matching module: the receiving moduleis configured for receiving input information from a user; theextracting module is configured for extracting uncertainty informationof Chinese characters to be input, from the input information, whereinthe uncertainty information includes any one or more types of thefollowing information: tone information, structural configurationinformation, radical information, homophone information, and locationinformation; and, the matching module is configured for determining andoutputting the Chinese characters to be input based on the determinedChinese characters and the uncertainty information.